VLM R1 Qwen2.5VL 3B Math 0305
Apache-2.0
A vision-language model based on Qwen2.5-VL-3B-Instruct, enhanced with mathematical capabilities and trained using VLM-R1 reinforcement learning, specializing in solving math-related visual question answering tasks.
Text-to-Image
Safetensors English